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DETAILED ACTION 
Response to Amendment 

1 . The affidavit filed on May 2, 2005 under 37 CFR 1 .131 has been considered but 
is ineffective to overcome the Walker (U.S. Patent Application Publication 
2003/0050777) reference. 

The evidence submitted is insufficient to establish diligence of attorney in the 
preparing and filing of the patent application as per M.P.E.P. 2138.06. A draft 
application was prepared by June 1 1 , 2001 . A final application was forwarded to the 
inventors on December 28, 2001 , over six months after the preparation of the initial draft 
application. The application was filed on January 29, 2002. The Applicant has asserted 
that draft applications were diligently worked on in the time period between June 1 1 , 
2001 and December 28, 2001 ; however, there is no evidence of record to substantiate 
that claim. For example, there is no evidence of draft application revisions between 
June 1 1 , 2001 and December 28, 2001 , nor is there evidence of a backlog of unrelated 
cases that the attorney may have had. Therefore, the Applicant has failed to show 
reasonable diligence on behalf of the attorney in preparing and filing the patent 
application and the affidavit has been considered ineffective to overcome the Walker 
reference. 

Accordingly, the rejections made in the previous Office Action stand. 
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2. Furthermore, with regard to the use of official notice in the rejections of claims 4, 
11, 12, and 17, it is noted that the applicant has not made any attempt to traverse the 
assertion of official notice, therefore the well-known in the art statement is taken to be 
admitted prior art (see MPEP 2144.03) 

Specification 

3. The amendments to the specification overcome the objections made in the 
previous Office Action. The objections to the specification are withdrawn. 

Drawings 

4. The amendments to the drawings overcome the objections made in the previous 
Office Action. The objections to the drawings are withdrawn. 

Claim Objections 

5. The amendments to the claims overcome the objections made in the previous 
Office Action. The objections to the claims are withdrawn. 

Claim Rejections - 35 USC § 102 

6. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that 
form the basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(e) the invention was described in (1) an application for patent, published under section 122(b), by 
another filed in the United States before the invention by the applicant for patent or (2) a patent 
granted on an application for patent by another filed in the United States before the invention by the 
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applicant for patent, except that an international application filed under the treaty defined in section 
351 (a) shall have the effects for purposes of this subsection of an application filed in the United States 
only if the international application designated the United States and was published under Article 21 (2) 
of such treaty in the English language. 

7. Claims 14 and 15 are rejected under 35 U.S.C. 102(e) as being anticipated by 
Walker (U.S. Patent Application Publication 2003/0050777). 

In regard to claim 14, Walker discloses a system for integrating acoustic data 
using speech recognition (Fig. 1,10), comprising: 

a communication module (server 38) which receives voice data from a plurality of 
computers each having speech recognition residing thereon (personal computers 16, 
22, 28, and 34 with respective speech recognizers 12, 18, 24, and 30), the 
communication module residing on the plurality of computers or a remote server (page 
1, paragraph 11, lines 1-5 and page 2, paragraph 12) 

an evaluator module associated with each of the plurality of computers, the 
evaluator module analyzes the voice data from each of the plurality of computers 
(transcription service 36 analyzes the for user information and time stamps of each 
entry, page 1 , paragraph 1 1 , lines 6-9); and 

an integrator module associated with the evaluator module, the integrator module 
integrates all of the analyzed voice data from each of the plurality of computers and 
provides one decoding output (the transcription entries are arranged as an ordered and 
interleaved transcription of the plurality of computers, page 1 , paragraph 1 1 , line 9 
through page 2, 1 st column, line 3). 



In regard to claim 15, Walker discloses: 
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the voice data is associated with at least two master speakers associated with 
the speech recognition associated with different computers of the plurality of computers 
(speech recognizers 12, 18, 24, and 30 are associated with particular first through fourth 
persons, respectively, page 1, paragraph 8, lines 9-12 and paragraph 10, lines 4-14); 
and 

the integrator module integrates the voice data of the at least two master 
speakers into the one decoding output (the conversation of several persons is 
interleaved by transcription service 36, page 2, 1 st column, lines 1-3 and paragraph 13, 
lines 3-11). 

Claim Rejections - 35 USC § 103 

8. The following is a quotation of 35 U.S.C. 1 03(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 1 02 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill' in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

9. Claims 1-13, 16-17, and 19-20 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Walker, in view of Bennett et al. (U.S. Patent 6,701,293). Claims 4, 
11, 12, and 17 are rejected under 35 U.S.C. 103(a) as being unpatentable over Walker, 
in view of Bennett et al., and further in view of the applicant's admitted prior art. 
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In regard to claims 1 and 19, Walker discloses a method and a machine readable 
medium containing code for integrating acoustic data using speech recognition (Fig. 2), 
comprising the steps of: 

detecting voice data on a first computer (voice data from a first user is detected 
by first speech recognizer on a first computer, step 100, page 2, paragraph 15, lines 4- 
7); 

identifying the voice data as a first master speaker associated with a speech 
recognition system residing on the first computer (step 100, the speech recognizer 
recognizes the voice data as coming from person #1 , page 2, paragraph 15, lines 4-7); 

analyzing the voice data residing on the first computer (step 102, the speech 
recognizer converts the utterance into a dictation including text, page 2, paragraph 15, 
lines 7-11); and 

integrating the analyzed voice data from the first computer into a single decoding 
output (step 106, results from a plurality of speech recognizers are integrated into an 
interleaved transcript, page 2, 2 nd column, lines 6-9). 

Walker further discloses detecting a second voice data on a second computer 
and analyzing and integrating that second voice data into the output transcript (see Fig. 
2). 

Walker does not disclose providing the first voice data from the first computer to 
the at least second computer having a speech recognition system thereon and 
recognizing the first voice data in parallel on both the first and at least second 
computers. 
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Bennett et al. disclose a method for recognizing speech (Fig. 3) that analyzes 
voice data (input stream) on a plurality of speech recognizers and integrates the results 
of those recognizers into a single output (column 3, lines 51-53, and column 4, lines 20- 
21). 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to modify Walker to provide a first voice data to each of the speech 
recognizers 12, 18, 24, and 30 in parallel, and then combine the results of those 
recognizers into a single decoding output, as taught by Bennet et al., in order to 
increase the recognition accuracy of the overall system and provide a more accurate 
transcript of the voice data. 

In regard to claims 2 and 20, Walker discloses: 

detecting a second voice data on at least the second computer (voice data from 
a second user is detected by a second speech recognizer on a second computer, page 
2, paragraph 16, lines 3-7); 

identifying the second voice data as being a second master speaker associated 
with the speech recognition system of the at least the second computer (step 1 08, voice 
data is recognized as coming from person #2, page 2, paragraph 16, lines 3-7); 

analyzing the second voice data residing on the at least the second computer 
(step 110, the speech, recognizer converts the utterance into a dictation including text, 
page 2, paragraph 16, lines 7-10); and 
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integrating the analyzed voice data and the analyzed second voice data into the 
single decoding output (step 106, results from a plurality of speech recognizers are 
integrated into an interleaved transcript, page 2, paragraph 26, lines 12-17). 

Walker does not disclose providing the second voice data from the at least 
second computer to the first computer and recognizing the first voice data in parallel on 
both the first and at least second computers. 

Bennett et al. disclose a method for recognizing speech (Fig. 3) that analyzes* 
voice data (input stream) on a plurality of speech recognizers and integrates the results 
of those recognizers into a single output (column 3, lines 51-53, and column 4, lines 20- 
21). 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to modify Walker to provide a second voice data to each of the speech 
recognizers 12, 18, 24, and 30 in parallel, and then combine the results of those 
recognizers into a single decoding output, as taught by Bennet et al., in order to 
increase the recognition accuracy of the overall system and provide a more accurate 
transcript of the voice data. 

In regard to claim 3, Walker discloses the at least second computer is a second 
and third computer (Fig. 1, first computer 16, second computer 22, and third computer 
38). 
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In regard to claim 4, Walker discloses each computer is associated with a person 
(speech recognizers 12, 18, 24, and 30 are associated with particular first through fourth 
persons, respectively, page 1, paragraph 8, lines 9-12 and paragraph 10, lines 4-14). 
Furthermore, Walker discloses that each recognizer identifies when its particular person 
is speaking (see, for example page 2, paragraph 15, lines 4-7). In Fig. 1, the audio 
associated with each person is shown closest to the respectively associated speech 
recognizer. 

Walker is silent as to the details of how the first and second speakers are 
identified. 

The applicant's admitted prior art discloses that the volume of sound input 
received by a microphone increases dramatically when somebody is speaking into it. 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to modify Walker et al. to perform the step of detecting who was speaking by 
determining when a volume was higher than a predetermined background noise level 
threshold on that user's respective computer, because determining who was speaking 
with a simple volume threshold comparison would greatly reduce the amount of 
processing needed to determine who was speaking. 

In regard to claim 5, Walker discloses: 

summarizing the analyzed voice data and the second voice data into a single 
transcript (the transcription entries are arranged as an ordered and interleaved 
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transcription of the plurality of computers, page 1 , paragraph 1 1 , line 9 through page 2, 
1 st column, line 3). 

In regard to claims 6 and 7, Walker does not disclose the analyzed voice data is 
weighted. 

Bennet et al. disclose weighting the analyzed voice data differently for each of 
the plurality of recognizers (column 6, lines 7-16). The higher weight of the analyzed 
voice data would necessarily be selected as the most accurate rendition of the voice 
data. 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to further modify Walker to weight the analyses of the first voice data and 
second voice data according to a weight assigned to the first recognizer and a weight 
assigned to the second recognizer, so that poor recognition results from one of the 
recognizers would not overly influence the final recognition result, thereby increasing 
the accuracy of the overall system. 

In regard to claim 8, Walker does not disclose providing a confidence level for 
each word. 

Bennett et al. disclose providing a confidence level for each recognition result 
generated by each recognizer (column 5, lines 6-8). The confidence level is used in the 
integration of the results of each recognizer (column 5, lines 10-14). 
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It would have been obvious to one of ordinary skill in the art at the time of 
invention to further modify Walker et al. to provide a confidence level to each word 
associated with both the first voice data and second voice data, so that at the 
integration step, the recognizer with the highest confidence level would be selected as 
the correct result, as taught by Bennet et al. This would ensure that the best recognition 
result was included in the transcript, thereby increasing the accuracy of the transcript 
and reducing the need for later editing of the transcript. 

In regard to claim 9, the connections disclosed by Walker between the first 
computer and at least second computer (arrows in Fig. 1) must necessarily 
communicate in a wire or wireless communication protocol. 

In regard to claim 10, Walker discloses: 

the speech recognition of the first computer and the at least the second computer 
are one of (i) a same speech recognition system and (ii) a different speech recognition 
system (any number of speech recognition software applications are used for speech 
recognizers 12, 18, 24, and 30, page 2, paragraph 14, lines 1-3); and 

the first master speaker and the second master speaker are further associated 
with the speech recognition of the at least the second computer and the first computer, 
respectively (speech recognizers 12, 18, 24, and 30 are associated with particular first 
through fourth persons, respectively, page 1, paragraph 8, lines 9-12 and paragraph 10, 
lines 4-14). 
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In regard to claim 11, neither Walker nor Bennett et al. specifically disclose 
filtering out the background noise. 

The applicant's admitted prior art discloses filtering out background noise, since 
background noise included in the speech data drastically reduces a speech recognizer's 
ability to accurately recognize speech. 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to further modify the combination of Walker and Bennett et al. to filter out 
background noise, since background noise included in the speech data drastically 
reduces a speech recognizer's ability to accurately recognize speech. 

In regard to claim 12, neither Walker nor Bennett et al. disclose providing 
feedback to the first computer and at least second computer relating to the performance 
of the recognizers or maintaining a record of credibility. 

The applicant's admitted prior art discloses providing feedback relating to the 
performance analysis of a speech recognizer so that the performance recognizer could ' 
be improved. Furthermore, it is notoriously well known and recognized in the art to 
maintain a record of credibility relating to the ability to recognize a master speaker. 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to further modify the combination of Walker and Bennet et al. to track the 
speech recognition performance (the accuracy of determining what was being said) as 
well as the voice recognition performance (the accuracy of determining who was talking) 
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so that both could be continuously adapted to increase the accuracy of both using any 
well known adaptation method. 

In regard to claim 16, Walker does not disclose providing a confidence level for 
each word. 

Bennett et al. disclose providing a confidence level for each recognition result 
generated by each recognizer (column 5, lines 6-8). The confidence level is used in the 
integration of the results of each recognizer (column 5, lines 10-14). 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to further modify Walker et al. to provide a confidence level to each word 
associated with both the first voice data and second voice data, so that at the 
integration step, the recognizer with the highest confidence level would be selected as 
the correct result, as taught by Bennet et al. This would ensure that the best recognition 
result was included in the transcript, thereby increasing the accuracy of the transcript. 

In regard to claim 17, Walker discloses each computer is associated with a 
person (speech recognizers 12, 18, 24, and 30 are associated with particular first 
through fourth persons, respectively, page 1, paragraph 8, lines 9-12 and paragraph 10, 
lines 4-14). Furthermore, Walker discloses that each recognizer identifies when its 
particular person is speaking (see, for example page 2, paragraph 15, lines 4-7). In Fig. 
1 , the audio associated with each person is shown closest to the respectively 
associated speech recognizer. 
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Walker is silent as to the details of how the first and second speakers are 
identified. 

The applicant's admitted prior art discloses that the volume of sound input 
received by a microphone increases dramatically when somebody is speaking into it. 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to modify Walker et al. to perform the step of detecting who was speaking by 
determining when a volume was higher than a predetermined background noise level 
threshold on that user's respective computer, because determining who was speaking 
with a simple volume threshold comparison would greatly reduce the amount of 
processing needed to determine who was speaking. 

Claim 18 is rejected under 35 U.S.C. 103(a) as being unpatentable over Walker, 
in view of Saito et al. (European Patent Application 1 061 724). 
Walker discloses: 

a final decoder output module associated with the integrator module, the final 
decoder output module prepares a summary of the decoded output; and 

summurator module for receiving the summary of the decoded output (the 
conversation of several persons is interleaved and stored by transcription service 36, 
page 2, 1 st column, lines 1-3 and paragraph 13, lines 3-11). 

Walker does not disclose a sender module for sending the decoded output to a 
computer of the plurality of computers for transcription or editing the decoded output. 
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Saito et al. disclose a sender module for sending the decoded output to a 
computer of the plurality of computers for transcription or editing the decoded output 
(image display means 15, correction input mean 16, and correction executing means 17 
function to send the decoded output to users for correction and editing, page 10, 
paragraph 52 and page 1 1 , paragraphs 53 and 54). 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to modify Walker to include a sender module so that any misrecognized words 
could be corrected at the consent of all the users who's voice was transcribed, as taught 
by Saito et al. (column 14, lines 6-10). 

Conclusion 

10. THIS ACTION IS MADE FINAL. Applicant is reminded of the extension of time 
policy as set forth in 37 CFR 1.136(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within 
TWO MONTHS of the mailing date of this final action and the advisory action is not 
mailed until after the end of the THREE-MONTH shortened statutory period, then the 
shortened statutory period will expire on the date the advisory action is mailed, and any 
extension fee pursuant to 37 CFR 1 .1 36(a) will be calculated from the mailing date of 
the advisory action. In no event, however, will the statutory period for reply expire later 
than SIX MONTHS from the mailing date of this final action. 
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1 1 . Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Brian L. Albertalli whose telephone number is (571) 272- 
7616. The examiner can normally be reached on Mon - Fri, 8:00 AM - 5:30 PM, every 
second Fri off. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Wayne Young can be reached on (571 ) 272-7582. The fax phone number 
for the organization where this application or proceeding is assigned is 703-872-9306. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). 

BLA 7/14/05 




